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® Method for the expression of heterologous proteins produced in fused form in E. coll, use thereof, 
expression vectors and recombinant strains, i 

^ @ The present invention relates to the field of biotechnology and in particular the use of recombmant DNA 

technology for the production of heterologous proteins. 
CO The technical object thereof is to develop a highly efficient method for the expression of heterologous genes 
^ In fused form in E . coli . which code for proteins which can easily be purified owing to the fact that they are 

synthesized in ins"olubie~form in the cellular cytoplasm. 
^ To achieve this, an expression vector is used which contains a stabiliser sequence which codes only for the 
O first 58 amino acids belonging to the N-terminat end of the human protein interleukine-2, which Is under the 

tryptophan promoter of the actual E . coli . This vector further contains the gene for resistance to ampiclllin as a 
R; selection marker and the terminator of transcript ion of bacteriophage T4. In particular the genes which code for 

the antigenic proteins of the human immunodeficiency virus (HIV 1 and 2) were cloned therein, high levels of 

expression of said proteins being obtained from transformed strains of the bacteria Escherichia coli . 
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The method which is the subject of the present invention can be employed for the expression at high 1 vels 
of recombinant heterologous proteins synthesized in fused and insoluble form in E . coli , which can be used in 
the pharmaceutical industry to obtain vaccine preparations or in the development of diagnostic systems, in the 
food industry, in agriculture, etc. 



2 



EP 0 416 673 A1 

METHOD FOR THE EXPRESSION OF HETEROLOGOUS PROTEINS PRODUCED IN FUSED FORM IN E . CPU 
p USE THEREOF, EXPRESSION VECTORS AND RECOMBINANT STRAINS 

The present invention relates to the field of biotechnology and recombinant DNA techniques and in 
particular to a method for the expression of heterologous proteins synthesized in fused and insoluble form 
from recombinant E . col[ bacteria. 

The utility of recombinant DNA technology for producing proteins of interest, of any origin m E . coli , 
5 has been extensively demonstrated. For this, a large number of vectors have been developed, although new 
variants are still necessary owing to the fact that each gene to be cloned and expressed represents an 
individual case (Denhardt. D.T. and Colasanti. J.. Vectors. Butterworths, Stoneham. MA. pp. 179-204. 1987 
and Lukacsovich, T. et al.. Journal of Biotechnology. 13. 243-250. 1990). 

Many eukaryotic polypeptides of clinical or industrial interest, the natural availability of which Is scarce, 
have been obtained by cloning and expression of the genes which code for them in Escherichia coli . 

An important problem associated with the production of recombinant proteins in microorganisms is 
degradation of the product by the host system's own proteases. The stability of the protein can be 
influenced by different factors such as location of the gene product (Talmadge K. and Gilbert W., Proc. Natl 
Acad. Sci. USA 79. 1830-1833. 1982: Moks T. et al.. Biochemistry 26. 5239-5244. 1987). selection of the 
host strain (Bueli et al., Nucleic Acids Res. 13. 1923-1938. 1985: Bishai W.R. et al.. J. Bacteriol. 169. 
5140-5151. 1987; Grodberg J. and Dunn. J.J.. Bacteriol. 170. 1245-1253. 1988) as well as the conditions of 
subsequent cultivation and purification (Kitano. K. et al.. J. Biotechnol. 5. 77-86, 1987). 

Eukaryotic genes cloned in phase with bacterial or synthetic nucleic acid sequences can be expressed 
as hybrid products in the cellular cytoplasm. Transcription from bacterial promoters as well as translation 
thereof yields fusion proteins which include bacterial or synthetic polypeptide sequences in addition to the 
eukaryotic polypeptides (Marston, F.A.O., Biochem. J. 240. 1-12. 1986). 

Intracellular synthesis of a fusion protein by expression of a heterologous gene of interest fused to a 
well-expressed host gene, is a valid means of obtaining high levels of expression of a heterologous protein 
as well as an increase in stability of the product obtained (Itakura. K. et al.. Science. 198. 1056-1063, 1977). 

One of the systems used more for this purpose has been to obtain proteins fused to the beta- 
galactosidase from E . coli (Itakura. K. et al.. Science, 198. 1056-1063, 1977). However, the main 
disadvantage of this"system is the large size of this protein, on account of which the desired peptide 
represents only a small portion of the total hybrid protein (Flores, N. et al.. Appl. Microbiol. Biotechnol. 25. 
267-271, 1986; Goeddel. D.V. et al., PNAS USA. 76: 106-110). 

German patent no. 35 41 856 Al (Hoechst AG) reports on the possibility of using a stabiliser peptide 
consisting of at least the first 95 amino acids of the N-terminal end of the human protein interieukine-2 to 
obtain fusion proteins in insoluble fomn synthesized in E . coli . with a view to expressing eukaryotic 
peptides such as proinsulin and hirudine. without reference to the levels of expression reached with this 
system. In this patent are also included in the genetic construction particular sequences for cleavage of the 
35 end product with a view to separating the protein of Interest from the stabiliser peptide. 

The production of viral proteins by genetic engineering is of great Interest for the development of 
methods of diagnosis and vaccine preparations, above all because of the purity of the resulting products as 
well as the elimination of manipulation of the active pathogenic agent. In the field of diagnosis, these 
products are of great importance In early detection of antibodies to these, organisms, high specificity and 
40 sensitivity in said systems being achieved. 

In particular, in the case of human retroviruses, it is necessary to develop highly sensitive systems for 
the detection of antibodies on the basis of very pure antigens, avoiding any loss of specificity which would 
invalidate the use of them. These organisms cause various immunological changes, depending on the 
particular subgroup to which the viral agent belongs, and also due to Its trophism for T-lymphocyte cells. 
46 being able to cause abnormal proliferation or impaired functionality of said cells (leukaemia) or a depletion 
of the cell population (immunosuppression) (Wong-Staal. F. and Gallo. R.C., Nature. 317. 395-403. 1985). 

It is ttieretore necessary to count on efficient systems of expression of the main proteins with antigenic 
activity belonging to the viruses which cause these diseases, with a view to using them in rapid and precise 
diagnostic systems, which will make it possible to carry out large-scale epidemiological studies for the 
so detection of antibodies to these viruses during processing of blood samples in banks and thus to prevent 
the disease from being transmitted by this pathway. 

The genes which code for the main proteins with antigenic activity of human immunodeficiency viruses 
(HIV) have been cloned and expressed in E . coli , both directly and fused to other genes belonging to said 
host. 
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Among th proteins expressed in their natural form are peptide 121 of AIDS, which Is obtained in 
insoluble form with levels of expression varying between 5 and 10% of total proteins (Chang, T.W. et al.. 
Biotechnology 3. 905-909. 1985) and protein gag 24 of the same virus which is obtained in soluble form at 
levels not calculated (Dowbenko. DJ. et al.. Proc. Natl. Acad. Sci. USA. 82. 7748-7752. 1985). 

s In Spanish patent no. 2 000 859 (Syntex) is d scribed a method for the expression of fusion prot ins 

using a vector which contains a DNA gene of the protein TrpLE of E . coli . in which is specifically inserted 
a DNA sequence of the AIDS virus. In this case, the carboxy-tenminal LE region is substituted by a 
heterologous polypeptide, as a result of which a self-aggregating fusion protein is obtained, purification 
thereof being simplified in this way. Moreover the vector used contains binding means for three reading 

10 frames which facilitates isolation of the protein of interest in this patent Is described the construction of a 
clone of high expression which produces more than 5% of the total cell protein. 

The present invention relates to a method for the expression of heterologous proteins produced in 
insoluble form in E . coli and in particular fusion proteins which contain a fragment or the whole of a viral 
protein such as the caie of antigenic proteins belonging to human immunodeficiency vims (HIV 1 and 2). 

75 For this, there was used a vector which contains a stabiliser sequence wich codes for approximately the 
first 58 amino acids of the N-tenminal end of human interleukine-2 (iL-2). which guarantees high levels of 
expression of the heterologous genes cloned. This vector further consists of the tryptophan promoter of E . 
coli (ptrp). the gene for resistance to ampicillin as a selection marker, the terminator of transcription of 
bacteriophage T4 and restriction sites )^ I. )Cto I and Bam HI for coupling of the genes which it is desired 

20 to express. The present invention therefore also relates to the expression vectors used for cloning and 
expression of the different antigenic proteins of HIV 1 and 2 in E . coN as well as the recombinant strains 
obtained, which express levels of said heterologous proteins varying between 20 and 25% of the total 
proteins produced by them. 

In particular, the proteins expressed were the one belonging to tiie nucleus (gag24) and a fragment of 

25 the coat protein (gp41) of virus HIV 1 and a fragment of the transmembraneous protein gp36 of HIV 2. The 
strains used as hosts for cloning of the genes which code for these proteins were E . coli K-12 HB-101, W- 
3110 and C-600 respectively. 

An innovating feature of the present invention is ttie use of a stabiliser sequence, which consists of a 
fragment of the N-terminal end of the gene of human interleukine-2 protein which codes only for the first 68 

30 amino acids of said protein, which is used for the expression of heterologous proteins and in particular the 
main proteins with antigenic activity of tiie HIV virus. 

The fusion proteins expressed by means of tiie method described are synthesized in insoluble form, 
which simplifies the final purification process and makes it more efficient, on account of which proteins 
which display antigenic activity are obtained, which are used in diagnostic methods for the detection of 

35 antibodies to them without ttie need for cleavage of the stabiliser fragment used in the fusion, tine present 
invention also relating to use of the fusion protein obtained. 



EXAMPLES 



Example 1 

For the expression of different heterologous proteins in E . coli . there was constructed the expression 
45 vector pFP-15, in which was inserted the sequence which codes for a stabiliser peptide, consisting of the 
first 58 amino acids belonging to the N-terminal fragment of the prc>tein of human origin, Interleukine-2 (IL-2) 
. Said sequence is cloned under the control of the tryptophan promoter of E . coli . said vector further 
comprising the terminator of bacteriophage T4 as a signal of termination of transcription and the gene for 
ampicillin resistance as a selection marker. 
50 The plasmid vector pFP-15 was constructed by ligation of a synthetic oligonucleotide of 190 bases and 
its complementary one. which contains the sequence which codes for the first 58 amino acids of the N- 
terminal end of IL-2 (Fig. 1). tiie stabiliser sequence, and the vector pTPV-1 (Rg. 2) which carries the 
tryptophan promoter of Escherichia coli and the terminator of bacteriophage T4. The layout of said 
construction is shown in Fig. 2. 
55 Coupling of tiie DNA segment which codes for the above stabiliser peptide was verified by DNA 
sequence analysis according to tiie description in the literature (Sanger, F. et al., PNAS, USA. 74. 5463- 
5467, 1977) using an oligonucleotide (Rg. 3) which hybridises witii tiie ptrp promoter and sequence in the 
direction of the stabiliser {5-3 ). Thus it was possible to check that in all cases the appropriate reading 
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frame was maintained. 



Example 2 

5 

For cloning and expression of the nuclear protein of virus HIV-1 (gag24). the following oligonucleotides 
were designed: 

5' CAT CTA GAC ATQ CAA ATQ TTA AAA GAA 3 
3 GT TTA GGT CGA TTG ACT ATC CTA GGC 5 

TO These oligonucleotides correspond to the 6 and 3' ends respectively of the gene which codes for protein 
gag24 (Alizon, M. et al.. Nature 312, 757-760. 1984). With these oligonucleotides and with the genome of 
HIV-1 isolated, amplification was carried out by the technique of the polymerase chain reaction (PGR) 
(Randall. K. et al.. Science, USA. 239. 487-491, t988) of the gene which codes for a fragment of the gag24 
gene. This fragment was cut at sites Xba I and Bam HI. which were contained in the oligonucleotides used 

15 in the PCR. and was ligated to the expression vector pFP-15, )^ l-Bam HI being digested, the amplified 
gene being thus ligated to the segment which codes for the stabiliser peptide, under the tryptophan 
promoter. The recombinant plasmid obtained, called VIHCA (Rg. 4), was transformed in cells of E . coll 
strain K-12 HB-101. The transformed colonies were selected for ampicillln resistance in dishes of Luria 
broth medium (Miller, J.H.. Cold Spring Harbor Lab., 1972) supplemented with the antibiotic at 50 ug/ml 

20 final concentration, and the recombinants were identified by the technique of hybridisation, using as a 
radioactive probe (labeled with ^^p) the actual amplified fragment used for cloning. An immunoidentification 
test was carried out on the positive ones in autoradiography, with serum of infected patients and ^^^i labeled 
protein A, expression of the protein gag24 being identified by the positivity of these clones in the 
immunological technique. On these individuals was carried out the Western blot technique (Burnette, W.N., 

25 Anal. Bloch.. 112. 195-203, 1981), a band of approximately 28.000 daltons being obtained, which cor- 
responds to the length of the stabiliser peptide (approximately 58 amino acids) plus the fragment of the 
cloned protein gag24 (approximately 180 amino acids). 

30 Example 3 

For cloning and expression of the transmembraneous protein gp41, first of all the synthesis of an 
oligonucleotide of 269 bases and its complementary one (Rg. 5) was carried out, corresponding to a 
fragment of said protein belonging to the coat of the virus (Han. B.H. et al,, Nature 312, 166-169, 1984). 

35 This oligonucleotide was digested with Bam HI and ligated to the vector pPF-15 previously cut with Xba I, 
treated with SI nuclease and finally digested by Bam HI, the desired gene remaining fused to the segment 
which codes for the stabiliser peptide, under the tryptophan promoter of said vector. The product of this 
ligation is the vector VIHTA-1 (Rg. 6) with which E . coH strain K-12 W-3110 was transformed. The 
transformed colonies were selected for ampicillin resistance, and the recombinants were identified by the 

40 technique of hybridisation, an immunoidentification test being performed on the positive ones as in the 
preceding example. Western blot was carried out on one of the individuals which showed expression of the 
fused protein, a band of approximately 15.000 daltons being obtained, which corresponds to the expected 
size of the fusion protein, which includes the 58 amino acids of the stabiliser peptide plus 83 amino acids 
corresponding to the fragment of protein gp41 of HIV-1. 

45 

Example 4 

Cloning was carried out of the region representing the gene which codes for expression of the 
50 transmembraneous protein of HIV-2. gp36, by the synthesis of an oligonucleotide of 318 bp and its 
complementary one (Fig. 7) corresponding to a fragment of the protein gp36 of the coat of HIV-2 (Clavel, F. 
et al.. Science 233. 343-346). This DNA segment was ligated to the vector pPF-15 previously cut by Xba 1/ 
Bam HI. the desired fragment remaining fused to the gene which codes for the stabiliser peptide, under the 
tryptophan promoter of said vector. The product of this ligation is the vector VIHTA-2 (Rg. 8). which was 
55 inserted in E . coli strain K-12 C-600. The transformed colonies were selected for ampicillin resistance, and 
the recombTnante" wer identified by the technique of hybridisation, an immunoidentification test b ing 
performed on th positive ones as in xamples 2 and 3. 

In all cases, coupling of the DNA segments to the stabiliser was verified by DNA sequence analysis, as 
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reported in the Irterature (Sanger. F. et aJ., PNAS. USA. 74, 5463-5467. 1977). 



Example 5 

5 

In the case of the fusion proteins gag24-stabHiser peptide and gp41 -stabiliser peptide, the respective 
recombinant strains HB24 and W41 were grown in super broth medium (32 g tryptone and 20 g yeast 
extract per litre of distilled water) supplemented with FeCb (0.001 mM). MgS04 (0.1 mM). M9 salts 
(Na2HP0t at 6%. KH2PO4 at 3%. NaCI at 0.5% and NHdCI at 1%) and ampiciilin 50 ug/ml. 

70 For the protein gp36-stabiliser peptide, the transformed strain 03B was grown on minimal medium 
(Maniatis et ai.. Cold Spring Harbor Lab.. USA. 1982) supplemented with casein hydroiysate at 2%, glucose 
at 2%. 1 mM MgSO*. 0.1 mM CaClz and ampiciliin at the same concentration as in the previous case. 

Inoculation of the cultures was carried out at an optical density of 0.05. maintaining them at 37 C for 12 
hours, with agitation at 260 rpm. aeration at 1 vvm. finally reaching an optical density of 10 read at 600 nm. 

15 induced by depletion of the tryptophan by the addition of indoleacryiic add (Squires, C.L. et al.. Jour, of 
Mol. Biol., USA, 92. 93-111. 1975) two hours after the start of fermentation. The cells obtained are collected 
by centrituging and stored at -20 * C to be used subsequently in recovery of the desired product. After 
ultrasonic rupture of the biomass. levels of expression of 20-25% of the total protein are ascertained by 
SDS-PAGE electrophoresis of proteins (Laemmli. Nature, UK, 227, 680-685. 1070) and analysis of the series 

20 on a SCANNER 65 300. USA. 



STRAIN DEPOSITS 

25 The E . coli HB24 [pVIHCA] strain, based on the E . coli strain K-12 HB-101 and containing the plasmid 
pVIHCA."waF^eposited on July 11. 1990. with the Centraalbureau voor Schimmelcultures (CBS), Baarn. 
The Netherlands, and obtained deposit number CBS... .90. 

The E . coli W41 [pVIHTA-1] strain, based on the E . coli strain K-12 W-3110 and containing the 
plasmid pVIHTA^I. was deposited on July 11. 1990. with the Centraalbureau voor Schimmelcultures (CBS). 
30 Baam. The Netherlands, and obtained deposit number CBS....90. 

The E . coli C36 [pVIHTA-2] strain, based on the E . coli strain K-12 C-600 and containing the plasmid 
pVIHTA-S waTdeposited on July 11. 1990. with the Centraalbureau voor Schimmelcultures (CBS). Baarn. 
The Netherlands, and obtained deposit number CBS....90. 
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f^pcpmnce T.istina 



SEQUENCE TYPE: Nucleotide with corresponding protein 

SEQUENCE LENGTH: 194 base pairs 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: genomic DNA 
'B ORIGINAL SOURCE ORGANISM: Human interleukin 2 

IMMEDIATE EXPERIMENTAL SOURCE: Nucleotide synthesis 

FEATURES: from 8 to 181 bp mature peptide 
20 PROPERTIES: Coding gene for stabilizer peptide 

CGATTCC ATG GCG CCT ACT TCA AGT TCT ACA AAG AAA ACA 4 
Met Ala Pro Thr Ser Ser Ser Thr Lys Lys Thr 
25 5 10 



30 



35 



40 



45 



50 



CAG CTA CAA CTG GAG CAT TTA CTG CTG GAT TTA CAG ATG 7 9 
Gin Leu GLn Leu Glu His Leu Leu Leu Asp Leu Gin Met 
15 20 

ATT TTG AAT GGA ATT AAT AAT TAC AAG AAT CCC AAA CTC 118 
He Leu Asn Gly He Asn Asn Tyr Lys Asn Pro Lys Leu 
25 30 35 

ACC AGG ATG CTC ACA TTT AAG TTT TAC ATG CCC AAG AAG 157 
Thr Arg Met Leu Thr Phe Lys Phe Tyr Met Pro Lys Lys 
40 45 50 

GCC ACA GAA CTG AAA CAT CTC CAG TGTCTAGAGC TAG 194 
Ala Thr Glu Leu Lys His Leu Gin 

55 
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.qF.O TP NO: 2 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 17 base pairs 
MOLECULE TYPE: DNA 

TCGAACTAGT TAACTAG 17 



.qy.n ID NO: 3 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 27 base pairs 
MOLECULE TYPE: DNA 

CATCTAGACA TGCAT^TGTT AAAAGAA 27 
5;F.0 TP NO: 4 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 26 base pairs 
MOLECULE TYPE: DNA 

CGGATCCTAT CAGTTAGCTG GATTTG 26 
<?E0 IP NQ:5 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 268 base pairs 
MOLECULE TYPE: DNA 



GGGGAAGCTC AACAACACTT GTTGCAATTG 
ATTGCAAGCT AGAATCTTGG CTGTTGAAAG 
TGTTGGGTAT CTGGGGTTGT TCTGGTAAGT 
CAATGGAACG CTTCTTGGTC TAACAAGTCT 
CATGACTTGG ATGGAATGGG ACAGAGAAAT 
AATAGGGATC CGTCGACC 



ACTGTTTGGG GTATCAAGCA 50 
ATACTTGAAG GACCAACAAT 100 
TGATCTGTAC TACTGCTGTT 150 
TTGGAACAAA TCTGGAACAA 200 
CAACAACTAC ACTTCTTTGT 250 

Zoo 
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SF.O TP NO: 6 



5 



SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 321 base pairs 
MOLECULE TYPE: DNA 



CTAGAAGTTC 


AGCAACAACA 


ACAGTTATTG 


GACGTAGTTA 


AGAGACAACA 


50 


GGAACTATTG AGACTAACCG 


TTTGGGGAAC 


CAAGAACTTA 


CAGGCAAGAG 


100 


TAACTGCTAT 


CGAGAAATAT 


CTACAAGACC 


AGGCTCGTCT 


AAATTCATGG 


150 


GGATGTGCAT 


TCCGTCAGGT 


ATGTCACACT 


ACCGTACCAT 


GGGTTAATGA 


200 


TTCTTTAGCT 


CCAGACTGGG 


ATAATATGAC 


CTGGCAGGAG 


TGGGAAAAGC 


250 


AAGTACGTTA 


CTTAGAGGCT 


AACATTTCAA 


AAAGTTTGGA 


GCAGGCACAG 


300 


ATCCAGGGTA 


CTAATAGCTA 


G 






321 



c^jrn TP MO: 7 

SEQUENCE TYPE: N-terminal fragment of human interleukin 2 
SEQUENCE LENGTH: 58 amino acids 



MOLECULE TYPE: Peptide 
ORIGINAL SOURCE ORGANISM: Human interleukin 2 
IMMEDIATE EXPERIMENTAL SOURCE: Nucleotide synthesis 
^® FEATURES: from 1 to 58 amino acid mature peptide 
PROPERTIES: Stabilizer peptide 

40 Met Ala Pro Thr Ser Ser Ser Thr Lys Lys Thr Gin 



30 




45 



Leu GLn Leu 



lie Leu Asn 



25 



Glu His Leu Leu Leu Asp Leu Gin Met 

20 

Gly lie Asn Asn Tyr Lys Asn Pro Lys 
30 35 



50 



Lys Lys Ala 
50 



Leu Thr Arg 



Met Leu Thr Phe Lys Phe Tyr Met Pro 

40 45 
Thr Glu Leu Lys His Leu Gin 



55 
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1. A method for the expression of heterologous proteins produced in fused form in E. in which a stabiliser 
sequence is used for expression of the heterologous proteins, consisting of an N-terminal fragment of 
human interleukine-2. characterised in that said sequence codes for not more than the first 58 amino acids 
of this protein, to which is fused the sequence of the heterologous protein to be expressed. 

2. A method according to claim 1. characterised in that the amino acid sequence of the stabiliser peptide 
corresponds to: 

10 20 30 40 50 

MAPTSSSTKK TQLQLEHLLL DLQMILNGIN NYKNPKLTRM LTFKFYMPKK ATELKHLQ 

3. A method according to claim 1. characterised in that the heterologous proteins which are expressed 
con-espond to the nuclear protein (gag24) and the transmembraneous protein (gp41) belonging to human 
Immunodeficiency vinjs HIV-1 and the transmembraneous protein gp36 belonging to human im- 
munodeficiency virus HIV-2. 

4. Expression vector pFP-15, characterised in that it contains the stabiliser sequence which codes for tee 
first 58 amino adds of human interleukine-2 under the tryptophan promoter of E . ooli with the signal for 
termination of bacteriophage T4 and the gene for ampicillin resistance, and contains the restriction sites 
Xba I. Bam HI and Xho I for fusion of the heterologous protein which is to be expressed, 

sTVeclori" VIHCA.^VIHTA-1 and VIHTA-2 derived from pFP-15, characterised in that they contain gene 
sequences coding for protein gag24. for a fragment of protein gp4l (both of HIV-1) and for a fragment of 
protein gp36 of HiV-2 respectively, which are coupled to the stabiliser sequence of vector pFP-15 using the 
restriction sites present therein. 

6. Recombinant strains HB24. W41 and C36. characterised in that they are obtained as a result of 
transformation of E . coli strains K-12 HB-101. W-3110 and C-600 with the vectors VIHCA, VIHTA-1 and 
VIHTA-2 respectively and that they express high levels of the antigenic HIV proteins in insoluble form, 

7. Fusion proteins obtained according to the preceding claims, characterised in that they are composed of a 
peptide which includes the first 58 amino acids belonging to the N-terminal end of human interleukine-2. 
which is fused to a heterologous protein. 

8. Fusion proteins according to claim 7. characterised in that said heterologous protein corresponds to the 
protein gag24. protein gp41 (both of HIV-1) or protein gp36 of HIV-2. 

9. Use of the fusion proteins obtained according to the preceding claims, characterised in that they can be 
used in diagnostic methods for the detection of human or animal antibodies. 
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CGATTCCATG GCGCCTACTT CAAGOTCTAC AAAGAAAACA CAGCTACSAAC TGGAGCATTT 

ACTGCTGGAT TTACAGAT6A TTTTGAATGG AATTAATAAT TACAA6AATC CCAAACTCAC 

CAGOATGCTC ACATTTAAGT TTTACATGCC CAAGAAGGCC ACAGAACTGA AACATCTCCA 
181 

GTQTCTAGAG/ ctag 

-Extension CG at 5* end sticky Cla I 

-Extension CTAG at 5* end of the complementary strand sticky BamH I 
-ATG initiation of transcription 

FIG.1 
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FIG. 2 
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5* TCGAACTAGTTAACTAO 3' 



FIG. 3 



1 

66G6AA6CTC A&CAACACTT 
61 

A6AATCTT66 CTGTTGAAA6 
121 

TCTGQTAAGT TGATCT6TAC 
181 

TTGGAACAAA TCTGGAACAA 
241 

ACTTCTTTGT AATAGGGATC 

BaoiH 



GTTGCAATTG ACTGTTTGGO 
ATACTTGAAG GACCAACAAT 
TACTGCTGTT CAATGGAACG 
CATGACTT6G ATGGAATG6G 

CGTCGACC 

I 



GTATCAAGCA ATTOCAAGCT 
TGTTGGGTAT CTG666TTGT 
CTTCTTGGTC TAACAAGTCT 
ACAGA6AAAT CAACAACTAC 



FIG.5 
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FIG.6 
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1 

CTAGAA6TTC A6CAACAACA ACAGTTATTO 
61 

AGACTAACCG TTTGGGGAAC CAAGAACTTA 
121 

CTACAA6ACC A66CTCGTCT AAATTCATGG 
181 

ACCGTACCAT 6GGTTAAT6A TTCTTTAGCT 
241 

TGG6AAAAGC AAGTACGTTA CTTAGAGGCT 
301 

ATCCAGGGTA CTAATAG/Ctag 



GACGTAGTTA AGAGACAACA 6GAACTATTG 
CAGGCAAGAG TAACTGCTAT CGAGAAATAT 
6GATGTGCAT TCC6TCAQGT ATGTCACACT 
CCA6ACTGGG ATAATATGAC CTGGCAGGAG 
AACATTTCAA AAAGTTTGGA GCAGGCACAG 



-Extension CTAG at 5' end sticky Xba I 

-Extension CTAG at 5' end of the complementary strand sticky BamH I 



FIG. 7 



16 



EP 0 416 673 A1 



Xba I 




FIG. 8 
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